Activist Data Mining for Computational Science: Tools and Applications
نویسنده
چکیده
Classical data mining involves: waiting for data to appear and then mining it. Activist data mining involves: proposing experiments based on algorithmic and application-specific considerations, evaluating the results, proposing new experiments, evaluating, proposing, and so on. Thus Activist Data Mining is a fundamentally interventionist and iterative endeavour. It entails close collaboration with application specialists. The techniques required include combinatorial design to support a disciplined experimental design, a variety of analog circuit-building techniques, and hypothesis-generation. The talk and this paper discusses these tools in the context of a series of case study collaborations with biologists and physicists. The necessary scientific background will be presented to make the discussion self-contained. The talk is meant to appeal to researchers and practitioners in data mining as well as any visiting natural scientists. The data sizes range from 30,000 items for microarrays to trillions of items in gamma ray experiments. My intent in this paper is to convey the philosophy of my appraoch. You can find the technicalities on my web page or on the conference site. I will concentrate on biology because that is where I do
منابع مشابه
Detecting Diseases in Medical Prescriptions Using Data Mining Tools and Combining Techniques
Data about the prevalence of communicable and non-communicable diseases, as one of the most important categories of epidemiological data, is used for interpreting health status of communities. This study aims to calculate the prevalence of outpatient diseases through the characterization of outpatient prescriptions. The data used in this study is collected from 1412 prescriptions for various ty...
متن کاملa swift heuristic algorithm base on data mining approach for the Periodic Vehicle Routing Problem: data mining approach
periodic vehicle routing problem focuses on establishing a plan of visits to clients over a given time horizon so as to satisfy some service level while optimizing the routes used in each time period. This paper presents a new effective heuristic algorithm based on data mining tools for periodic vehicle routing problem (PVRP). The related results of proposed algorithm are compared with the resu...
متن کاملDetecting Diseases in Medical Prescriptions Using Data Mining Tools and Combining Techniques
Data about the prevalence of communicable and non-communicable diseases, as one of the most important categories of epidemiological data, is used for interpreting health status of communities. This study aims to calculate the prevalence of outpatient diseases through the characterization of outpatient prescriptions. The data used in this study is collected from 1412 prescriptions for various ty...
متن کاملSports Result Prediction Based on Machine Learning and Computational Intelligence Approaches: A Survey
In the current world, sports produce considerable statistical information about each player, team, games, and seasons. Traditional sports science believed science to be owned by experts, coaches, team managers, and analyzers. However, sports organizations have recently realized the abundant science available in their data and sought to take advantage of that science through the use of data mini...
متن کاملAnalyzing and Investigating the Use of Electronic Payment Tools in Iran using Data Mining Techniques
In today's world, most financial transactions are carried out using done through electronic instruments and in the context of the Information Technology and Internet. Disregarding the application of new technologies at this field and sufficing to traditional ways, will result in financial loss and customer dissatisfaction. The aim of the present study is surveying and analyzing the use of elect...
متن کامل